21 research outputs found
Converting Neuroimaging Big Data to information: Statistical Frameworks for interpretation of Image Driven Biomarkers and Image Driven Disease Subtyping
Large scale clinical trials and population based research studies collect huge amounts of neuroimaging data. Machine learning classifiers can potentially use these data to train models that diagnose brain related diseases from individual brain scans. In this dissertation we address two distinct challenges that beset a wider adoption of these tools for diagnostic purposes.
The first challenge that besets the neuroimaging based disease classification is the lack of a statistical inference machinery for highlighting brain regions that contribute significantly to the classifier decisions. In this dissertation, we address this challenge by developing an analytic framework for interpreting support vector machine (SVM) models used for neuroimaging based diagnosis of psychiatric disease. To do this we first note that permutation testing using SVM model components provides a reliable inference mechanism for model interpretation. Then we derive our analysis framework by showing that under certain assumptions, the permutation based null distributions associated with SVM model components can be approximated analytically using the data themselves. Inference based on these analytic null distributions is validated on real and simulated data. p-Values computed from our analysis can accurately identify anatomical features that differentiate groups used for classifier training. Since the majority of clinical and research communities are trained in understanding statistical p-values rather than machine learning techniques like the SVM, we hope that this work will lead to a better understanding SVM classifiers and motivate a wider adoption of SVM models for image based diagnosis of psychiatric disease.
A second deficiency of learning based neuroimaging diagnostics is that they implicitly assume that, `a single homogeneous pattern of brain changes drives population wide phenotypic differences\u27. In reality it is more likely that multiple patterns of brain deficits drive the complexities observed in the clinical presentation of most diseases. Understanding this heterogeneity may allow us to build better classifiers for identifying such diseases from individual brain scans. However, analytic tools to explore this heterogeneity are missing. With this in view, we present in this dissertation, a framework for exploring disease heterogeneity using population neuroimaging data. The approach we present first computes difference images by comparing matched cases and controls and then clusters these differences. The cluster centers define a set of deficit patterns that differentiates the two groups. By allowing for more than one pattern of difference between two populations, our framework makes a radical departure from traditional tools used for neuroimaging group analyses. We hope that this leads to a better understanding of the processes that lead to disease and also that it ultimately leads to improved image based disease classifiers
Control-Group Feature Normalization for Multivariate Pattern Analysis Using the Support Vector Machine
Normalization of feature vector values is a common practice in machine learning. Generally, each feature value is standardized to the unit hypercube or by normalizing to zero mean and unit variance. Classification decisions based on support vector machines (SVMs) or by other methods are sensitive to the specific normalization used on the features. In the context of multivariate pattern analysis using neuroimaging data, standardization effectively up- and down-weights features based on their individual variability. Since the standard approach uses the entire data set to guide the normalization it utilizes the total variability of these features. This total variation is inevitably dependent on the amount of marginal separation between groups. Thus, such a normalization may attenuate the separability of the data in high dimensional space. In this work we propose an alternate approach that uses an estimate of the control-group standard deviation to normalize features before training. We also show that control-based normalization provides better interpretation with respect to the estimated multivariate disease pattern and improves the classifier performance in many cases
Addressing Confounding in Predictive Models with an Application to Neuroimaging
Understanding structural changes in the brain that are caused by a particular disease is a major goal of neuroimaging research. Multivariate pattern analysis (MVPA) comprises a collection of tools that can be used to understand complex disease effects across the brain. We discuss several important issues that must be considered when analyzing data from neuroimaging studies using MVPA. In particular, we focus on the consequences of confounding by non-imaging variables such as age and sex on the results of MVPA. After reviewing current practice to address confounding in neuroimaging studies, we propose an alternative approach based on inverse probability weighting. Although the proposed method is motivated by neuroimaging applications, it is broadly applicable to many problems in machine learning and predictive modeling. We demonstrate the advantages of our approach on simulated and real data examples
Recommended from our members
Clinical and surgical management of holocervical spinal cord ependymomas.
Background:Spinal ependymomas are rare tumors of the central nervous system, and those spanning the entire cervical spine are atypical. Here, we present two unusual cases of holocervical (C1-C7) spinal ependymomas. Case Description:Two patients, a 32-year-old female and a 24-year-old male presented with neck pain, motor, and sensory deficits. Sagittal MRI confirmed hypointense lesions on T1 and hyperintense regions on T2 spanning the entire cervical spine. These were accompanied by cystic cavities extending caudally into the thoracic spine and rostrally to the cervicomedullary junction. Both patients underwent gross total resection of these lesions and sustained excellent recoveries. Conclusion:Two holocervical cord intramedullary ependymomas were safely and effectively surgically resected without incurring significant perioperative morbidity
Diagnostic potential of structural neuroimaging for depression from a multi-ethnic community sample
Background
At present, we do not have any biological tests which can
contribute towards a diagnosis of depression. Neuroimaging
measures have shown some potential as biomarkers for
diagnosis. However, participants have generally been from the
same ethnic background while the applicability of a biomarker
would require replication in individuals of diverse ethnicities.
Aims
We sought to examine the diagnostic potential of the structural
neuroanatomy of depression in a sample of a wide ethnic diversity.
Method
Structural magnetic resonance imaging (MRI) scans were
obtained from 23 patients with major depressive disorder in
an acute depressive episode (mean age: 39.8 years) and
20 matched healthy volunteers (mean age: 38.8 years).
Participants were of Asian, African and Caucasian ethnicity
recruited from the general community.
Results
Structural neuroanatomy combining white and grey matter
distinguished patients from controls at the highest accuracy of
81% with the most stable pattern being at around 70%. A
widespread network encompassing frontal, parietal, occipital
and cerebellar regions contributed towards diagnostic
classification.
Conclusions
These findings provide an important step in the
development of potential neuroimaging-based tools for
diagnosis as they demonstrate that the identification of
depression is feasible within a multi-ethnic group from the
community.
Declaration of interests
C.H.Y.F. has held recent research grants from Eli Lilly and
Company and GlaxoSmithKline. L.M. is a former employee and
stockholder of Eli Lilly and Company
Recommended from our members
Deep learning for medical image segmentation { using theIBM TrueNorth Neurosynaptic System
Breast DCE-MRI Kinetic Heterogeneity Tumor Markers: Preliminary Associations With Neoadjuvant Chemotherapy Response
The ability to predict response to neoadjuvant chemotherapy for women diagnosed with breast cancer, either before or early on in treatment, is critical to judicious patient selection and tailoring the treatment regimen. In this paper, we investigate the role of contrast agent kinetic heterogeneity features derived from breast dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) for predicting treatment response. We propose a set of kinetic statistic descriptors and present preliminary results showing the discriminatory capacity of the proposed descriptors for predicting complete and non-complete responders as assessed from pre-treatment imaging exams. The study population consisted of 15 participants: 8 complete responders and 7 non-complete responders. Using the proposed kinetic features, we trained a leave-one-out logistic regression classifier that performs with an area under the receiver operating characteristic (ROC) curve (AUC) of 0.84 under the ROC. We compare the predictive value of our features against commonly used MRI features including kinetics of the characteristic kinetic curve (CKC), maximum peak enhancement (MPE), hotspot signal enhancement ratio (SER), and longest tumor diameter that give lower AUCs of 0.71, 0.66, 0.64, and 0.54, respectively. Our proposed kinetic statistics thus outperform the conventional kinetic descriptors as well as the classifier using a combination of all the conventional descriptors (i.e., CKC, MPE, SER, and longest diameter), which gives an AUC of 0.74. These findings suggest that heterogeneity-based DCE-MRI kinetic statistics could serve as potential imaging biomarkers for tumor characterization and could be used to improve candidate patient selection even before the start of the neoadjuvant treatment